An Effective Fuzzy Clustering of Crime Reports Embedded by a Universal Sentence Encoder Model
نویسندگان
چکیده
Crime reports clustering is crucial for identifying and preventing criminal activities that frequently happened in society. In the proposed work, named entities a report are recognized to extract crime-related phrases subsequently, preprocessed by applying stopword removal lemmatization operations. Next, module of universal encoder model, called transformer, applied get sentence embedding each associated sentence, aggregation which finally provides vector representation report. An innovative efficient graph-based algorithm consisting splitting merging operations has been cluster crime reports. The generates overlapping clusters, indicates existence multiple types. fuzzy theory used provide score expressing its membership into different accordingly, labelled categories. efficiency method assessed taking account datasets comparing them with other state-of-the-art approaches help various performance measure metrics.
منابع مشابه
Universal Sentence Encoder
We present models for encoding sentences into embedding vectors that specifically target transfer learning to other NLP tasks. The models are efficient and result in accurate performance on diverse transfer tasks. Two variants of the encoding models allow for trade-offs between accuracy and compute resources. For both variants, we investigate and report the relationship between model complexity...
متن کاملMerging Duplicate Bug Reports by Sentence Clustering
Duplicate bug reports are often unfavorable because they tend to take many man hours for being identified as duplicates, marked so and eventually discarded. In this time, no progress occurs on the program in question, and is justifiably an overhead which should be minimized. Considerable research has been carried out to alleviate this problem. Many methods have been proposed for bug report cate...
متن کاملOPTIMIZATION OF FUZZY CLUSTERING CRITERIA BY A HYBRID PSO AND FUZZY C-MEANS CLUSTERING ALGORITHM
This paper presents an efficient hybrid method, namely fuzzy particleswarm optimization (FPSO) and fuzzy c-means (FCM) algorithms, to solve the fuzzyclustering problem, especially for large sizes. When the problem becomes large, theFCM algorithm may result in uneven distribution of data, making it difficult to findan optimal solution in reasonable amount of time. The PSO algorithm does find ago...
متن کاملOptimal Sentence Clustering Using An Innovative Hierarchical Fuzzy Clustering Algorithm
The role of data clustering is inevitable in many text processing activities .Many proceedings are going on in this area since it has wider applications. Sentence clustering is a challenging task when compared with other data clustering, because a sentence is able to represent same ideas in different ways. For E.g. some people see a glass as half empty and some others see half full. Due to this...
متن کاملa tripartite model of efl teachers attributions, burnout, and self-regulation: towards the prospects of effective teaching
همطالعه حاضر به ارائه مدلی برای آموزش موثر زبان انگلیسی می پردازد. مدل حاضر از سه عامل تاثیر گذار در کارایی تدریس معلمان زبان انگلیسی بهره می برد. این سه عامل شامل سبکهای اسنادی، خود تنطیمی و فرسودگی شغلی معلمان ایرانی زبان انگلیسی می باشد. رساله مورد نظر درچهار فاز طراحی شده است: فاز اول شامل طراحی و رواسازی پرسشنامه سبکهای اسنادی معلمان زبان انگلیسی و فاز دوم شامل استفاده از این پرسشنا...
ذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Mathematics
سال: 2023
ISSN: ['2227-7390']
DOI: https://doi.org/10.3390/math11030611